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Abstract. We report on the results of a detailed analysis of the X-ray spectral properties of a large sample of sources detected 
serendipitously with the XMM-Newton observatory in 25 selected fields, for which optical identification is in progress. The 
survey covers a total solid angle of ~ 3.5 deg 2 and contains 1137 sources with ~ 10~ 15 < So.5-10 < 10~ 12 ergcnr 2 s -1 with 
good enough spectral quality as to perform a detailed X-ray spectral analysis of each individual object. We find evidence for 
hardening of the average X-ray spectra of the sources towards fainter fluxes, and we interpret this as indicating a higher degree 
of photoelectric absorption amongst the fainter population. Absorption is detected at 95% confidence in 20% of the sources, 
but it could certainly be present in many other sources below our detection capabilities. For Broad Line AGNs (BLAGNs), 
we detect absorption in ~ 10% of the sources with column densities in the range 10 21 - 10 22 cnT 2 . The fraction of absorbed 
Narrow Emission Line galaxies (NELGs, most with intrinsic X-ray luminosities > 10 43 erg s , and therefore classified as type 
2 AGNs) is significantly higher (40%), with a hint of moderately higher columns. After correcting for absorption, we do not 
find evidence for a redshift evolution of the underlying power law index of BLAGNs, which stays roughly constant at T ~ 1.9, 
with intrinsic dispersion of 0.4. A small fraction (~ 7%) of BLAGNs and NELGs require the presence of a soft excess, that we 
model as a black body with temperature ranging from 0.1 to 0.3 keV. Comparing our results on absorption to popular X-ray 
background synthesis models, we find absorption in only ~ 40% of the sources expected. This is due to a deficiency of heavily 
absorbed sources (with Nh ~ 10 22 - 10 24 cnT 2 ) in our sample in comparison with the models. We therefore conclude that the 
synthesis models require some revision in their specific parameters. 

Key words. X-rays: general, surveys, galaxies: active 



1. Introduction Chandra and XMM-Newton have reached fluxes 100 times 

fainter, resolving ~ 50 - 70% of the hard XRB (Mushotzky 

Extensive studies with different satellites have been performed et al ^Mfr Rarger et al. HH] Fiore et al HSU Giacconi et 

during the last decades to unveil the origin of the cosmic X- al rj^jr] Hasinger et al . rjftJT] Tozzi et alHOHIl Baku et al. 
ray background radiation (XRB). In the 0.5-2 keV band, the Worsley et aL |2TO . The fraction of resolved X-ray 

ROSAT deep surveys in the Lockman Hole (Hasinger et al. background has recent i y been increased to ~ 90% in the 2-10 

1998 1 resolved ~ 70 - 80% of the XRB into discrete sources keV and „ 94% in the q.5-2 keV bands by Cowie et al. lMHI 

down to a flux limit of ~ 10 15 erg cm 2 s 1 . The deepest and Moretti et al.l2003l respectively. 
Chandra observations have now resolved ~ 90% of the soft 

XRB reaching a flux limit of ~ 2 x 10~ 16 erg cnr 2 s" 1 (Moretti 0ptical follow . up of the faint X-ray sources has revealed 
et al.| 2003 ). tnat tne bu j k G f tne XRB is made up of type 1 and type 2 
In the hard X-ray sky, important contributions to the popu- AGNs. The results of optical identification programmes sug- 
lation studies were performed by the AS CA (G eorgan topoulos gest that the fractional contribution of type 2 AGNs is becom- 
et al. 1 19971 Cagnoni, Delia Ceca & Maccacaro[l99 8| Boy le et ing increasingly important towards fainter fluxes (e.g. Fiore et 
al. fl998l Ueda et al. fT998TT999l De lia Ce ca et al.p999i and al. l2003l . 
BeppoSAX (Giommi, Petri & Fiore UMi\ satellites. They re- 
solved ~ 30% of the 2-10 keV hard XRB (Ueda et al 119981 Now that the discrete origin of the XRB has been confirmed 
Fiore et al. 1200 It down to fluxes of 5 x 10~ 14 ergcnT 2 s _1 . all over the X-ray energy band (0.1-10 keV), efforts are being 
concentrated on the study of the source populations that make 
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is still controversial (Barger et al. 120021 Mainieri et al. 2002 
Fiore et al. 2003] Ueda et al. l2003l . 

At the faintest X-ray fluxes an important contribution to 
the XRB emission is expected to come from type 2 AGN 
(Setti& Woltier [T989l Gilli et al. l200Tl Comastri et al. [T995V 
This has been confirmed observationally (Mainieri et al. 2002 1. 
The distribution of intrinsic absorption among the sources has 
only been measured observationally for the brightest sources 
at low redshifts, but is still unknown at high redshifts. Recent 
Chandra and XMM-Newton studies have revealed that the 
widely accepted correlation between the optical obscuration 
and X-ray absorption (type 1 AGN being unabsorbed and type 
2 AGN being absorbed) is not always true. Several cases of 
type 1 AGN suffering from significant X-ray absorption have 
been found (Mittaz et al. 119991 Fiore et al. 120011 Page et al. 
I200TI Schartel et al. I200TI Tozzi et al. I200T1 Mainieri et al. 
120021 Brusa et al. l2003l Page et al. l2003l Carrera et al.|2004 
Perola et al. 2004 1, as well as Seyfert 2 galaxies unabsorbed in 
X-rays (Pappa et al. 12001 Panessa et al. 2002, Barcons et al. 

USD- 

The motivation for this work is to exploit the large col- 
lecting area and wide field of view (FOV) of XMM-Newton to 
perform a detailed spectral analysis of X-ray selected sources 
down to medium-faint fluxes. This study will allow us to put 
strong observational constraints on the average X-ray spec- 
tral properties of the source population that dominates the 
X-ray emission at the flux level of our survey, typically ~ 
10~ 14 ergcm" 2 s _1 in the 0.5-10 keV energy band. Therefore 
we have conducted a large (total solid angle of ~ 3.5 deg 2 ) 
and medium to deep (0.5-10 keV flux in the range ~ 10~ 15 - 
10" 12 erg cnT 2 s~') survey of sources detected with the XMM- 
Newton observatory in the 0.2-12 keV energy band. We take 
advantage of an ongoing identification effort of sources in the 
fields under study, which enable us to obtain spectral proper- 
ties of representative samples of identified sources. This work 
complements an earlier study of the X-ray spectral proper- 
ties of a large sample of sources at similar fluxes, but where 
the count rates detected in the XMM-Newton standard energy 
bands (0.2-0.5, 0.5-2, 2-4.5, 4.5-7.5, 7.5-12 keV) were used 
to analyse the broad band X-ray spectral shape of the sources 
(Mateos et al l2003> . 

The paper is organised as follows: Sect. [2]describes the X- 
ray data and the current status of the identifications; Sect. [3] 
describes the extraction of the X-ray spectral products; Sect.|4] 
describes the results of the spectral fitting; Sect. [5] discusses 
the average spectral properties of our sources; the X-ray spec- 
tral properties of sources identified as ALGs are described 
in Sect. |6j Sect. compares our observational results with 
specific XRB synthesis models; the results of our analysis 
are summarised in Sect. [8] Throughout this paper we have 
adopted the currently favoured concordance cosmology param- 
eters H = 70kms~ 1 Mpc~ 1 , Q„, = 0.3 and Q. h = 0.7. The 
luminosity values that we use in the paper are the intrinsic 
luminosities of the objects corrected for absorption. All one- 
parameter uncertainties are computed with a delta chi-squared 
of 2.706, equivalent to 90% confidence region for a single pa- 
rameter. 



2. Observations 

In this study we use 25 observations selected from the public 
XMM-Newton data archive. They cover a total solid angle of 
~3.5 deg 2 . To obtain a clean extragalactic sample of objects 
and to maximise the survey efficiency, the X-ray observations 
were selected according to the following criteria: sky positions 
at high galactic latitudes (\b\ > 20°); observations for which we 
had data from the European Photon Imaging Camera (EPIC)- 
pn detector in FULL-FRAME MODE (i.e. the EPIC-pn full 
FOV covered). We avoided fields containing bright point or ex- 
tended targets where the wings of the target can reduce sub- 
stantially the area of the survey covered. The current analysis 
has been performed within the framework of the tasks of the 
XMM-Newton Survey Science Centre 1 . 

The X-ray fields are listed in Table \l\ together with their 
observational details. 

2.1. X-ray Data reduction 

All the observations were processed through the XMM-Newton 
pipeline processing system that uses a suite of tasks created 
specifically for the analysis of XMM-Newton data, namely the 
Science Analysis System (Gabriel et al. l2004> . The current ver- 
sion at the time of the analysis being v5.3.3. The pipeline pro- 
cess provides calibrated event files, images and exposure maps 
in five standard X-ray energy bands (0.2-0.5, 0.5-2, 2-4.5, 4.5- 
7.5 and 7.5-12 keV) covering the energy range where the data is 
best calibrated. The X-ray images are created with a pixel size 
of 4"x 4"and cover a FOV of ~ 30' diameter. The exposure 
maps give information on the spatial efficiency of each camera 
including the energy dependent mirror vignetting. The pipeline 
products also provide source lists for each observation and for 
each EPIC camera (pn, MOS1 and MOS2). 

The source detection is run independently for each of the 
EPIC detectors, simultaneously on the five energy bands de- 
fined above. The area of the images where the sources are 
searched for is defined by the SAS task emask, that creates 
a map where CCD gaps and bad pixels/columns are excluded. 
The source detection process starts with the task eboxdetect 
(local mode) that performs a sliding box cell detection of 
sources with a minimum likelihood of detection of 8. This pro- 
vides an input list of source positions masked out by the SAS 
task esplinemap to build background maps for each detector 
and energy band. Eboxdetect (map mode) is run again, but 
this time the background is taken from the maps created by 
esplinemap. This improves significantly the detection sensi- 
tivity. Finally the task emldetect performs maximum likeli- 
hood Point Spread Function (PSF) 2 fits to the distribution of 
counts of the sources detected by eboxdetect, obtaining for 
each object the total (0.2-12 keV) count rate and the likelihood 
of detections (total and on each energy band). For a description 
of the application of Maximum Likelihood analysis to detec- 
tion of sources see Cruddace et al. 119871 The sources with a 
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Field 


R.A. 
(72000) 


Dec 
(72000) 


b a 
(dea) 


(10 20 cm" 


Rev./Obs. id 

- 2 ) 


Ml 


Filter b 
M2 




Ml 


GTI c (ksec) 
M2 




A2690 


00 


: 00 : 


30.3 


-25 


: 07 : 


30.0 


-78.90 


1.84 


0088/0125310101 


M 


M 


M 


19 


19 


21 


CL0016 + 16 


aa 
UO 


: 18 : 


1 1 a 
33. U 


i 1 1~ 
+ lo 


: 2d : 


1 O A 

18. 


-45.54 


4.07 


0194/0111000101 


M 


M 


M 


31 


31 


29 




















0194/0111000201 


M 


M 


M 


5 


5 


3 


G133 - 69 pos 2 


n i 
01 


. A/i . 

: 04 : 


00.0 


A£ 

-06 


. /I O . 

: 42 : 


AA A 

00.0 


-69.35 


5.19 


0104/0112650501 


Th 


M 


Th 


23 


23 


18 


G133 - 69 pos 1 


a i 


. A/1 . 

. 04 : 


o a a 
24.0 


— (JO 


: 24 : 


AA A 
00.0 


-68.68 


5.20 


0188/0112650401 


Th 


M 


Th 


23 


23 


20 


SDS - lb 


02 


: 18 : 


00.0 


-05 


: 00 : 


00.0 


-59.75 


2.47 


0118/0112370101 


Th 


Th 


Th 


47 


48 


40 




















0119/0112371001 


Th 


Th 


Th 


51 


51 


43 


SDS-3 


AO 

02 


.10. 

: 18 : 


/ion 
48.0 


A/1 

-04 


. Q A . 

: 39 : 


1 1 A 

13.0 


-59.35 


2.54 


0121/0112370401 


Th 


Th 


Th 


21 


21 


15 




















0121/0112371501 


Th 


Th 


Th 


7 


7 


4 


SDS - 2 


02 


: 19 : 


36.0 


-05 


: 00 : 


00.0 


-58.91 


2.54 


0120/0112370301 


Th 


Th 


Th 


50 


50 


40 


Mkn3 


AA 


• 1 ^ • 

. 13 . 


3u.3 


i n 1 
+ / 1 


. AO . 

. U2 . 


1 £ A 

13. U 


22.72 


8.82 


0158/0111220201 


M 


M 


M 


54 


54 


44 


MS0737 


0/ 


. A A . 

: 44 : 


A/1 C 

04.3 


i T/1 

+ /4 


: 33 : 


4y.3 


29.57 


3.51 


0063/0123100101 


Th 


Th 


Th 


39 


39 


20 




















0063/0123100201 


Th 


Th 


Th 


19 


19 


20 


CL0939 + 472 


aa 
09 


. A 1 . 

: 43 : 


AA A 
00.0 


i /i 

+40 


: 39 : 


QA A 
30.0 


48.88 


1.24 


0167/0106460101 


Th 


Th 


Th 


49 


49 


43 


S5 0836 + 716 


no 
08 


.At. 

: 41 : 


O A A 

24.0 


i -7A 
+ /O 


: 33 : 


A 1 A 

41.0 


34.43 


2.98 


0246/0112620101 


M 


M 


M 


4 


4 


25 


B2 1028 + 31 


10 


: 30 : 


59.1 


+31 


: 02 : 


56.0 


59.79 


1.94 


0182/0102040301 


Th 


Tck 


Th 


26 


26 


23 


B2 1128 + 31 


i i 
1 1 


: 31 : 


AA A 

09.4 


i Q 1 

+31 


. 1 A . 

: 14 : 


A-7 A 
0/.0 


72.03 


2.00 


0175/0102040201 


Th 


Tck 


Th 


19 


23 


13 


Mkn205 


I 2 


. 21 . 


A A A 

44. U 


+ /3 


. lo . 


in a 
3 /.U 


41.67 


3.02 


0075/0124110101 


M 


M 


M 


50 


50 


37 


MS1229.2 + 6430 


1 o 

12 


. Q 1 . 

: 31 : 


A 

32.0 


1 £/1 

+04 


. 1 A . 

: 14 : 


O 1 A 

21.0 


53.05 


1.98 


0082/0124900101 


Th 


Th 


Th 


34 


34 


29 


HD 117555 


13 


. qa . 

: 30 : 


/I H A 

4/.0 


i O /I 

+24 


: 13 : 


39.0 


80.67 


1.16 


0199/0100240101 


M 


M 


M 


29 


29 


25 




















0205/0100240201 


M 


M 


M 


36 


36 


33 


UZ Lib 


1 c 

1 J 


. 32 . 


A 

23. U 


no 

—us 


. 32 . 


A^ A 

U3.U 


36.57 


8.97 


0210/0100240801 


M 


M 


M 


23 


23 


23 


PKS 2126 - 158 


21 


: 29 : 


12.2 


-15 


: 38 : 


41.0 


-42.39 


5.00 


0255/0103060101 


M 


M 


M 


22 


22 


16 


PKS 2135 - 147 


21 


: 37 : 


45.2 


-14 


: 32 : 


55.4 


-43.85 


4.70 


0254/0092850201 


M 


M 


M 


15 


16 


28 


MS2137 - 23 


21 


: 40 : 


15.2 


-23 


: 39 : 


41.0 


-47.50 


3.50 


0254/0008830101 


Th 


Th 


Th 


13 


14 


10 


PB5062 


22 


: 05 : 


09.8 


-01 


: 55 : 


18.0 


-43.28 


6.17 


0267/0012440301 


Th 


Th 


Th 


31 


31 


28 


LBQS 2212 - 1759 


22 


: 15 : 


31.6 


-17 


: 44 : 


05.7 


-52.92 


2.39 


0355/0106660401 


Th 


Th 




33 


33 






















0355/0106660501 


Th 


Th 


Th 


8 


5 


8 




















0356/0106660601 


Th 


Th 


Th 


103 


103 


91 


PHL 5200 


22 


: 28 : 


30.4 


-05 


: 18 : 


55.0 


-49.97 


5.26 


0269/0100440101 


Tck 


Tck 


Tck 


45 


45 


43 


IRAS22491 - 18 


22 


: 51 : 


49.4 


-17 


: 52 : 


23.2 


-61.42 


2.71 


0267/0081340901 


M 


M 


M 


22 


22 


20 


EQ Peg 


23 


: 31 : 


50.0 


+ 19 


: 56 : 


17.0 


-39.14 


4.25 


0107/0112880301 


Tck 


Tck 


Tck 


14 


14 


12 



a Galactic latitude 

b Blocking filters: Th: Thin at 40nm Al ; M: Medium at 80nm A 1 ; Tck: Thick 

at200nm Al 
c Good time intervals after removal of flares 



total detection likelihood higher than 10 are included in the fi- 
nal list. 

2.2. X-ray source list 

The geometrical shadowing of half of the sky X-ray emission 
received by the MOS1 and MOS2 detectors (deviated to RGS 
gratings) makes the EPIC-pn camera a factor of two more sen- 
sitive than the MOS cameras overall. Therefore, to optimise the 
sensitivity of the survey, we used the pipeline EPIC-pn source 
lists. In the cases where we had more than one observation for 
a field we used the source list of the deepest exposure. 

We defined circular regions (with radius between 16 to 160 
arcsec), to exclude from the images the areas contaminated 
by the emission of the targets. For the objects detected near 



CCD gaps the uncertainties in their coordinates and flux may be 
larger than the statistical uncertainties as given by the pipeline 
products. In order to get rid of these sources with somewhat un- 
certain source parameters, we have excluded zones near CCD 
gaps, widening them in size up to the radius that contains 80% 
of the PSF on each detector point. We then used these masks 
to remove from the source list all the objects detected close to 
CCD gaps. For each field and observation we have examined 
the images searching for spurious sources, like detections in 
hot pixels or bright segments not removed by the pipeline pro- 
cess. We have checked carefully detections close to the tails of 
bright and/or extended objects, that in most cases turned out to 
be spurious. The visual screening process removed ~ 10% of 
the objects from the original list. The total number of sources 
detected within these specifications was 2145. 
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2.3. Source detection sensitivity 

The variety of data that we have used in our study, make it dif- 
ficult to calculate the source detection sensitivity as a function 
of the X-ray flux, because in principle it depends on the expo- 
sure times of the fields and their Galactic absorbing columns, 
N H (Zamorani et al. fT988l 

The XMM-Newton EPIC-pn detector is sensitive to X-ray 
photons at energies ranging from ~ 0.2 to ~ 12 keV. The sen- 
sitivity is maximum between 0.5 and 4.5 keV, where it is al- 
most independent of energy, i.e., objects with different spectral 
shapes are detected with similar efficiency. Outside this energy 
interval, the sensitivity is a strong function of energy. 

Because the band of detection of photons is wide, we do not 
expect the use of fields with different N^ al (ranging from 1 to 

9 x 10 20 cm~ 2 ) to introduce any significant bias (eg. detections 
in the fields with the higher values of Galaxy absorption biased 
towards hard 3 spectrum sources) in the broad band spectral 
properties of our objects at the flux limits of our survey. 

However, we can see in Tabled that the exposure times 
vary significantly from field to field. At 0.5-10 keV fluxes be- 
low ~ 10~ 14 erg cirr 1 s , where only a few fields contribute 
to the source list, we expect our survey to be biased against 
objects with spectra peaking outside the 0.5-4.5 keV energy 
range, for example highly obscured AGNs. 

To build our source lists, the XMM-Newton source de- 
tection algorithm was run simultaneously in the five stan- 
dard XMM-Newton energy bands. Therefore we have analysed 
sources detected in at least one of these energy bands. This 
might be important, because the sensitivity varies significantly 
from band to band, and hence, different source populations may 
contribute at different flux levels. 

In order to calculate the source detection efficiency func- 
tion, we have carried out simulations. A detailed description of 
how we obtained this selection function is given in appendixlAl 
There is indeed a mild dependence of the selection function on 
the spectral slope F at any given flux, and we will correct for 
this effect. 

Fig- n shows the distribution of our sources in bins of 0.5- 

10 keV flux. The fluxes were obtained from the best fit model 
of each object (see Sect, 15.41 ). The selection of fields with a 
range of exposure times results in a gradual reduction of the 
sky coverage as we go to fainter fluxes, and is responsible for 
broadening the flux distribution below ~ 10~ 14 erg crrr 2 s _1 . 
Our sample is likely to be representative of the dominant X-ray 
source population down to a flux of ~ 10~ 14 erg crrr 2 s _1 . For 
comparison we have also plotted in Fig. ^ the distribution of 
flux for the sources with optical spectroscopic identifications. 

The goal of the present paper is not to produce complete 
source lists to build up source counts or luminosity functions 
(which will be the subject of a forthcoming paper), but rather 
to study the X-ray spectral properties of representative samples 
within the medium flux range, where it is known that ~ 50% of 
the 0.5-2 keV (and ~ 40% of the 2-8 keV, see Bauer et al. 2004) 
accretion power in the Universe is produced. For this purpose, 

3 We use the term "soft spectra" when the measured spectral photon 
index, T, is found to be ~2, the typical value for unabsorbed AGNs. 
We consider a spectra to be hard if T <1.5 



Table 2. Breakdown of the sources with optical spectroscopic 
identifications. (BLAGN: broad line AGNs; NELG: narrow 
emitting line galaxies; ALG: absorption line galaxies) 



Object class 


N a 


N good b 


BLAGN 


149 


141 


NELG 


32 


29 


ALG 


10 


7 


Stars 


40 


32 


BL Lac 


1 


1 


Total 


232 


210 



a Number of sources identified 

b Number of sources identified that fulfil the quality thresholds ap- 
plied to the spectra (see Sect. [3} 



none of the above possible biases in the source list has any 
significant impact on our results, after appropriate corrections 
are applied. 

2.4. Optical spectroscopic identifications 

The selected fields are being followed-up in the optical band 
as part of the XMM-Newton Survey Science Centre XID iden- 
tification programme (Watson et al. 2001 ). The observing time 
was awarded by a project named AXIS 4 ("An XMM-Newton 
International Survey", Barcons et al 2002). The optical obser- 
vations consist of high quality multicolour imaging and opti- 
cal spectra of the brightest sources (the optical observations 
and spectroscopic identifications will be described in detail in 
a forthcoming paper). This is an ongoing process, whose first 
results were presented in Watson et al. 2001 and Barcons et al. 
120021 The full list of identified sources will be made available 
via the XMM-Newton Science Archive when completed down 
to a 0.5-4.5 keV flux of 2 x 10~ 14 erg cirT 2 s _I . Two identifica- 
tions in the A2690 field were from Piconcelli et al. 2002] For 
the time being, ~ 1 1 % (232) of the sources in our sample have 
been identified via optical spectroscopy. We have optical iden- 
tifications within all fields except one (MS 1229), the fraction 
of identified sources per field ranging from ~ 1.4% to ~ 39%. 

The objects were classified according to their optical emis- 
sion line properties. Sources with spectra showing strong 
broad emission lines (FWHM > 1000 km s~') were classified 
as Broad Line AGNs (BLAGNs). If only narrow emission lines 
were seen (FWHM < 1000km s~'), the sources were classi- 
fied (purely in optical spectroscopic terms) as Narrow Emission 
Line Galaxies (NELGs). Objects with galaxy-like optical spec- 
tra and no apparent emission lines were classified as absorption 
line galaxies (ALGs), some of which are associated to clusters. 
Table|2lists the number of identified sources, along with those 
which have good quality X-ray spectra (see below). 

Of all the identified sources, we have only analysed the 
spectra of BLAGNs, NELGs and ALGs. 

Fig. |2] shows the redshift distributions of these sources. 
BLAGNs are detected up to redshifts of ~ 3, while at the flux 
limit of our optical identifications we do not detect NELGs and 

4 http://venus.ifca.unican.es/~xray/AXIS/ 
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— All sources 

-■- Identified sources 




Log S, 



Fig. 1. Scs-iOkeV distribution for the whole sample (solid line) 
and the sub-sample of identified sources (dashed-dot line). 



— BLAGNs 
NELGs 

— ALGs 



2 

redshift 



Fig. 2. BLAGNs (solid line), NELGs (dashed line) and ALGs 
(dashed-dot line) redshift distributions. 



ALGs at redshifts above ~ 0.5. Similar distributions, at similar 
fluxes, have been found previously (e.g. Green et al. 2004]for 
the ChaMP survey). In spite of the incomplete identifications, 
we do expect the identified sources to constitute a "representa- 
tive" sample of the underlying population of sources at X-ray 
fluxes above ~ 2 x 10~ 14 erg cirT 2 s _1 , where the identified frac- 
tion of objects is much higher ( 50%). 



ground region was defined as an annulus centred at the source 
position, with inner radius equal to the source extraction ra- 
dius and outer radius three times the inner radius. If there was 
overlap with nearby sources, the sizes of source and/or back- 
ground regions were reduced to eliminate the overlap. If nearby 
sources were inside the background region their extraction re- 
gions were excluded from the background region. 

Source and background spectra were then extracted with 
the SAS task evselect. Single and double events were in- 
cluded for the pn detector and single-quadruple events for the 
MOS detectors in the energy band 0.2-12 keV. Further, only 
events with the highest spectral quality (FLAG=0) were included 
for the EPIC-pn data. 

In order to perform a proper spectral analysis, we created 
for each source the redistribution matrix file (RMF) and the an- 
cillary response file (ARF), using the SAS tasks rmfgen and 
arfgen respectively. To maximise the signal to noise ratio, we 
have combined MOS 1 and MOS2 source and background spec- 
tra and the corresponding response matrices. Merged source 
and background spectra were obtained by adding the individual 
spectra. Backscale values (size of the regions used to extract the 
spectra) and calibration matrices for the combined spectra were 
obtained weighting the input data with the exposure times. 

For the fields with more than one observation, we added all 
the MOS and pn data for each epoch but MOS and pn data were 
not merged because of their different responses 5 . 

In order to permit the x 1 minimisation technique in the fit- 
ting process, the raw spectra were binned such that each bin 
contained > 10 counts (source plus background). We then im- 
posed two quality thresholds on the grouped spectra selected 
for analysis: number of bins (MOS+pn) > 5, and number of 
background subtracted counts (MOS+pn) > 50. 

This filtering process resulted in a final sample of 1137 
sources having X-ray spectra of sufficient quality to analyse 
them individually (see Table |2]for the distribution of identified 
sources). 

Fig. |3 shows the quality of the selected spectra as a func- 
tion of the 0.2-12 keV number of counts for the whole sample 
of sources (solid-line histogram) and for the objects identified 
as BLAGNs and NELGs (shaded histogram). The vertical line 
indicates the quality filter that we applied to select the spectra 
appropriate for the analysis. In Fig. |4]we present, as an ex- 
ample, the X-ray spectra of some of the objects that we have 
analysed. 



3. Extraction of X-ray spectral products 

We have cleaned the calibrated event files with the same fil- 
tering expressions that were used by the pipeline to create the 
EPIC X-ray images. This filtering removes time intervals con- 
taminated by soft proton flares, cosmic ray tracks and spurious 
noise events not created by X-rays. For each object in the cata- 
logue the SAS task region was used to define the source and 
background extraction regions. 

The source region shape was defined as a circle centred on 
the source coordinates and with a typical radius of ~ 14 — 30", 
depending on the source position within the detector. The back- 



4. Spectral Analysis 

The data were fitted with the XSPEC (version 1 1 .2) software 
package. The spectral analysis has been carried out fitting si- 
multaneously MOS and pn spectra, forcing the models to have 
the same parameters in both instruments, including the normal- 
isation (we do not expect significant calibration mismatches be- 
tween instruments in the energy interval considered and for the 
quality level of our data). 

We quantified the results of the spectral fits in terms of 
the null hypothesis probability P(y 2 ) , i.e., the probability that 

5 see XMM-Newton handbook at http://xmm.vilspa.esa.es 
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Fig. 3. Distributions of background subtracted counts 
(MOS+pn) for the objects in our sample (solid histogram) 
and for the objects classified as BLAGNs or NELGs (shaded 
dot-dashed histogram). The vertical line indicates one of the 
quality thresholds applied to the spectra. For clarity, the last 
bin in the plots includes the spectra with more than 500 counts. 

the observed spectrum is derived from the parent model under 
scrutiny. We consider a model not good enough if P(x 2 ) is less 
than 5%. 

To test the significance of additional input parameters the 
F-test has been applied with an adopted significance threshold 
of 95%. As is shown in Protassov et al. 120021 if the condi- 
tions to use the F-test statistic are not satisfied, the false posi- 
tive rate might differ from the value expected for the selected 
confidence level. One of the conditions that must be satisfied, is 
that the null values of the additional parameters, should not be 
in the boundary of the set of possible values of the parameter. 
We are using the F-test to calculate the significance of detec- 
tion of absorption, and one of the possible values of the ab- 
sorbing column is zero. Hence, because one of the conditions 
for using the F-test is not satisfied, we do not know whether 
the fraction of spurious detections in our data differs from the 
5% expected. To calculate this number, we carried out simula- 
tions of unabsorbed spectra, and then we calculated the number 
of cases where we detected absorption with a confidence level 
above 95%. We have detected absorption in ~ 2% of the simu- 
lated spectra. As this fraction is not significantly different from 
the expected value, we decided to be conservative, and hence, 
we have continued with 5% as the number of spurious detec- 
tions, or false positive rates. 



4.1. Single power law (model A) 

We have started the analysis fitting a single power law to the 
background-subtracted spectra, hereafter model A. This model 
has two free parameters, the normalisation and the continuum 
slope T. To account for the effect of the galactic neutral hy- 
drogen absorption along the line of sight, a fixed photoelec- 
tric absorption component was included. The Galaxy H column 
density values for each field, N^ al , were extracted from the HI 
map of Dickey & Lockman ( 1990). For the selected acceptance 
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Fig. 4. From top to bottom unfolded spectra of 

a) XMMU J221454.9-173949, absorbed BLAGN; 

b) XMMU J084132.3+704757, unabsorbed NELG 
(L 2 -iokeV -eiO^ergs- 1 ); c) XMMU J33106.2+241325, 
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level, 139 (12%) objects had statistically unacceptable fits with 
this model, the expected number for the selected confidence 
level being 57. Within the sub-samples of identified sources, 
model A could not be accepted for 10 (7%) BLAGNs and 9 
(31%)NELGs. 

4.2. Single power law with excess absorption (model 
B) 

Synthesis models of the XRB, based on AGN unification 
schemes (Setti & Woltier [T9"g9l Madau, Ghisellini & Fabian 
[19941 Comastri et al. [T^95l Pom pilio, La Franca & Matt 2000 
Gilli, Risaliti & Salvati 1200 1 i predict a large population of 
highly absorbed AGNs. Hence we expect a significant fraction 
of our objects to exhibit absorption in excess of the Galactic 
value. An indication of absorption is to measure a flat spec- 
tral slope with model A. To study excess absorption, we fitted 
a model with two local absorbers, one with the column den- 
sity fixed at the Galactic value and the other one as a free 
parameter N° bs (model B). Note that this last term is essen- 
tially indistinguishable from an absorber with column density 
~ N° bs (1 + z) 2 7 (Barger et al l2UU51 Longair [T9^2l at the red- 
shift of the source, z, since absorption edges would not be de- 
tected in our highly grouped, moderate signal-to-noise spec- 
tra. This model has three free parameters, the normalisation, 
the continuum spectral slope T, and the equivalent absorbing 
column at z = 0, N° bs . The top panel in Fig. |5] compares the 
POf 2 ) distributions obtained with model A with the results from 
model B for the whole sample of sources. We see that the num- 
ber of unacceptable fits has been reduced by a factor of ~ 1.5, 
implying the presence of intrinsically absorbed objects. 

4.3. Single power law with intrinsic absorption (model 
C) 

As an alternative, more physical parameterisation of the ab- 
sorbing column of the BLAGN, NELG and ALG, we instead 
fitted photoelectric absorption at the redshift of the source 
(model C). The model has three free parameters, the normal- 
isation, the continuum spectral slope, T, and the rest frame ab- 
sorption, N^ tr . Fig. [5] compares the results of the fits obtained 
with model A and model C, for BLAGNs and NELGs. For a 
detailed description of the spectral properties of the 7 ALGs 
analysed, see Sec. [6] In the case of the BLAGNs, we did not 
find a substantial improvement in the overall quality of the fits, 
suggesting that the majority of the BLAGNs do not require in- 
trinsic absorption in excess of the Galactic value. Therefore, 
for the statistically unacceptable fits, more complicated models 
might be needed to reproduce the spectra of these sources. On 
the contrary, the observed significant improvement in the qual- 
ity of the NELG spectral fits obtained with model C, suggests 
that a large fraction of NELGs are intrinsically absorbed. 

4.4. Absorbed power law and soft excess (model D) 

The soft excess has been shown to be a common feature of the 
X-ray spectra of Seyfert 1 galaxies (Turner & Pounds 1989 1. 



Whole somple 



— model A 
model B 



20 



40 60 



— model A 

- - model C 



r 




40 60 

p(x 2 )m 



— model A 

- - model C 



40 60 



Fig. 5. Comparison of the null hypothesis probability distribu- 
tions, P(x 2 ), obtained from model A (simple power law, solid 
lines) and models B and C (observer frame absorbed power law 
and rest frame absorbed power law models, dashed lines). From 
top to bottom: whole sample of sources, BLAGNs and NELGs. 

The physical origin of this emission is still unclear, since its de- 
termination depends on a knowledge of the shape of the power 
law and the quantity of absorption. It has been interpreted as 
primary emission from the accretion disc, gravitational energy 
released by viscosity in the disc, or as secondary radiation from 
the reprocessing of hard X-rays in the surface layers of the disc. 
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We have searched for soft excess among the BLAGNs and 
NELGs. We first have fitted their spectra with a power law 
and a low energy black body component at the redshift of the 
source, absorbed by the Galaxy. The soft excess emission, pa- 
rameterised as a black body, adds to the model two parameters, 
the black body temperature and normalisation. 

We have compared the^- 2 of the fit with the values obtained 
with model A to search for the cases where the significance 
of a x 1 improvement was above 95%. From the 170 sources 
analysed (141 BLAGNs and 29 NELGs) we found evidence of 
soft excess emission in 12 sources (~7%), 10 BLAGNs and 2 
NELGs. 

As explained before, if soft excess is present in the spec- 
tra of the objects, it must be modelled properly to prevent the 
continuum spectral slope Y and/or the intrinsic absorption N^ tr 
being incorrectly determined. Therefore we have analysed the 
X-ray spectra of these 12 sources in more detail. All the spec- 
tra have a large enough number of bins, 35<bins<317, to use 
a model with five free parameters: a power law and a low 
energy black body component, both absorbed by the Galaxy 
(N° al ; fixed) and by absorption (N™ tr ; free) at the redshift of 
the sources (hereafter, model D). Results from this specific ex- 
ercise are presented in Sect. 15. 31 



5. Results of spectral fitting 

5.1. The continuum shape 

Although we know that a large fraction of sources in our list 
are absorbed, we have started the study with the results that 
we obtained with model A to allow comparison with spectral 
analyses of data with low signal to noise. This model, as was 
explained before, does not take into account the effects of ab- 
sorption or soft excesses. 

To calculate the objects average photon index, (F), we have 
used the standard formula for the weighted mean, 



<r> = J] p t x r, 



where the weight, P,, of each individual best fit value, F,, is a 
function of the error obtained from the fit, <x,, 



To calculate the uncertainty in (F) we have used the standard 
deviation (Bevington ll992> 



cr 2 = 



1 



(N-l) 



2 Pi x (T* - <r» 2 



that includes the measurement errors, <x,, and the dispersion of 
each T, from the estimated value (F). 

Following these definitions, we find our sources to have a 
continuum emission, from 0.2 to 12 keV, with an average slope 
of <r> = 1.86 + 0.02 (the arithmetic mean being 1.70 + 0.02). 
To study the dependence of the objects average spectrum on the 
X-ray flux, we have calculated the values of (T) in bins of 0.5- 
2 and 2-10 keV flux (each bin containing the same number of 
objects). Fig.[6]shows the results that we have obtained. We see 



that the average spectrum of the sources becomes harder with 
decreasing 0.5-2 keV flux, with (F) going from 2.01 + 0.03 at 
~ 4.4x 10" 14 erg cirT V to 0.95+0.12 at ~ 10~ 15 erg cirT 2 s _1 . 
This trend 6 (harder sources at fainter 0.5-2 keV fluxes) has 
been found in previous X-ray spectral analyses (e.g., Vikhlinin 
et al. H99"9l Mittaz et al. H99"9l Tozzi et al. EOTjTl Mainieri et al. 
2002 1. Its origin is still not clear, although it could be explained 
as a rapid increase in the intrinsic absorption of the objects with 
decreasing flux, as it is predicted by some AGN synthesis mod- 
els (see Gilli et al. l200Tl 

It is not possible to see the same effect using the 2-10 keV 
fluxes, where F seems to become softer at fainter fluxes (see 
Fig. |6j. It is important to note that this softening of F is not 
a real property of our objects. It comes from using differ- 
ent energy bands simultaneously in the source detection. As 
we go to fainter 2-10 keV fluxes, it is more difficult to de- 
tect sources with flat spectral slopes (hard objects) because 
their spectra peak at energies above ~ 2 keV where we know 
the sensitivity of the X-ray detectors decreases rapidly. Hence, 
the number of objects only detected in the soft band becomes 
more important at fainter 2-10 keV fluxes, specially below 
~ 2 x 10~ I4 ergcm~ 2 s _I . Note that the last bin at the faintest 
fluxes (see Fig.|6j is located at a value of (r) of more than 2.5. 
The objects that contribute to this bin have spectra with very 
few counts and therefore the values of F are poorly constrained. 

We have studied in more detail whether the observed hard- 
ening of F with decreasing soft flux could be affected by any 
bias introduced during our analysis. We first have checked that 
the same result is obtained when only sources with more than 
150 counts (circles in Fig. [6) are used. Therefore, the sources 
with poor spectral quality are not introducing any bias in our 
result. 

We expect the majority of the observations that we have 
used, not to be deep enough as to detect sources with a 0.5- 
2 keV flux of ~ 10~ 15 erg cirT 2 s _1 in the soft band. Hence, it 
could be possible that our (F) becomes harder with decreas- 
ing flux partly due to the contribution to each bin from bright 
hard sources not detected in the soft band. This can be checked 
by repeating the plot of Fig. [6] with those sources selected in 
the soft or hard bands (detection likelihoods above 10 on each 
band). The results, plotted in Fig. [7] show a very similar de- 
pendence of F with flux. Only the (F) of the bin at the faintest 
soft fluxes in Fig. [6] could be affected or even dominated by 
the contribution of bright hard sources not detected in the soft 
band. It is interesting to note that the hardening of F is more im- 
portant for objects detected in the 2-10 keV band (see Fig.0. 
If absorption is producing the hardening of F, this result sug- 
gests that we are seeing the same population of objects in the 
soft and hard bands, the only difference being that the objects 
detected in the 2-10 keV band are more absorbed on average. 

Finally we have used the source detection efficiency func- 
tion, W(r, S) (see Appendix [A), to correct our results for all the 
biases of our study. The de-biased average values of F at each 



6 The same dependence, i.e., hardening of (T) at fainter 0.5-2 keV 
fluxes, is observed if (T) is calculated with the arithmetic mean, there- 
fore the result is not an artifact of the weighted mean. 
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Fig. 6. Weighted photon index versus So.s-ihiW and 52-iokeV from model A (simple power law). The results obtained for the 
objects with X-ray spectra with more than 150 counts (background subtracted) are included for comparison. Error bars correspond 
to 90% confidence 
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Fig. 7. Weighted photon index versus S 0.5-2 kev from model A 
(simple power law) for sources with detection likelihood >10 
in the 0.5-2 keV band (squares) and for sources with detection 
likelihood >10 in the 2-10 keV band (circles). Error bars cor- 
respond to 90% confidence 

bin of flux were obtained weighting each individual value, T, 
with the function: 

= 1 /(trf x WjY^Sd) 

where the sum is calculated with all the sources falling in the 
specified bin in the F, S plane (for a detailed explanation see 
Appendix lAl. 

The results are shown in Fig. [8] the left plot for the whole 
sample of sources, and the right plot for the sources detected 
in the soft band. For comparison we have calculated the (r> 
values weighted with the fitting errors in the same flux bins 
(stars in the plots). We still see the hardening of (F) with the 
0.5-2 keV flux, so this dependence is indeed a real property of 
the population of the sources analysed. 



5.2. Excess Absorption 

The fit to the X-ray spectra with a single power law (model A) 
yields a large number of sources with statistically unacceptable 
fits. Models B and C reduce significantly the number of unac- 
ceptable fits as it is shown in Fig.|5] suggesting that a substan- 
tial fraction of our sources could be X-ray absorbed, in agree- 
ment with the predictions of the standard synthesis models of 
the XRB (Comastri et al. [T995l Gilli et al l20l)Tl 

To search for absorbed objects we used the F-test, measur- 
ing the improvement in the;f 2 of the fits after adding absorption 
to the model. A 95% confidence cutoff is adopted for absorp- 
tion to be detected. In Tabled we show the total number of ob- 
jects of different types, the number of absorbed objects of that 
type, and the fraction of absorbed objects of each type with its 
uncertainty and significance. 

For a sample of N objects, of which n are absorbed, the 
Bayesian posterior probability of the fraction of absorbed ob- 
jects is P(f) oc (f+f spu )"(l-f-f spu )"-" (from the binomial dis- 
tribution), where f spu is the fraction of spurious detections (for 
our sources 5%). We used the mode of P(f) to estimate the frac- 
tion (corrected from spurious detections) of absorbed objects 
within each class of sources. The errors in the fractions were 
calculated integrating P(f) from the mode value in the two di- 
rections until we obtained 45% of the probability (90% errors). 
For every "class" of sources, we measured the confidence of 
detection of absorption integrating the P(f) distributions from 
f=l down to the value where the integral is equal to 0.9973 (or 
3cr). These values, listed in the last column of Table[3] indicate 
the fraction of absorbed objects that we have detected within 
each class of sources with a significance of more than 3<x. For 
example, we know with a confidence of more than 3<x that at 
least ~ 1% of BLAGNs and ~ 17% of NELGs are absorbed. 

It appears that the fraction of absorbed objects varies sub- 
stantially among different classes of object, with the NELGs 
having the highest fraction. We have used the method de- 
scribed in Stevens et al. 2005 to study whether our samples 
of BLAGNs and NELGs have different fractions of absorbed 
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Fig. 8. Comparison of the weighted photon index versus S 0.5-2 kev from model A (simple power law) with the values obtained 
after correcting for the source list biases. Left for the whole sample of objects and right for the sub-sample of sources with 
detection likelihood >10 in the 0.5-2 keV band. Error bars correspond to 90% confidence 



objects. Assuming that BLAGNs and NELGs come from the 
same parent population, the fraction of absorbed objects is 

P(f) oc {f + f spa ) (n+m \l - / - fs pu ) (N+M -"- m) 

where M and N are the number of sources in the two samples 
and m and n the number of detections on each sample. 

The probability of detection of i objects in the first sample 
and j in the second one is given by 



P(i,j;N,M) = 



M\N 



x (1-/-/. 



dfP(f)(f + f spu ) {i+j) 



yN+M-i-j) 



spa 

The fractions of absorbed objects in BLAGNs and NELGs 
will be different if the probability of detecting < m in the 
first sample and > n in the second one is low, i.e. if P(> 
n,< m; N, M) = YJf=olI?=„P(i, j,N,M) is low. We find that 
P(m < 16,« > 13; 141,29) = 0.000013. Therefore we see 
that the fraction of absorbed objects in BLAGNs and NELGs 
is different with a significance of more than 3<x. We cannot dis- 
tinguish the fraction of absorbed objects among NELGs and 
ALGs (the probability of them being different is not signifi- 
cant). 

Fig. |9] shows the intrinsic Nj^ tr distributions measured for 
the absorbed BLAGNs and NELGs. BLAGNs show column 
densities 4.5 x 10 20 < NJf" < 2.3 x 10 22 cirT 2 while NELGs 
seem to cover a much wider range of absorption, with 1 . 1 x 
10 21 < N™ tr < 1.7 x 10 23 cirT 2 . However, the small number 
of objects does not allow us to find a significantly different (in 
terms of the Kolmogorov-Smirnov test) distribution of absorb- 
ing columns between BLAGNs and NELGs. 

To be confident that the detected intrinsic absorptions 
among identified sources were not affected by the quality of 
their spectra, we have plotted T versus NJ^ tr to search for any 
correlation between the two spectral parameters. Such a corre- 
lation would be expected if the detections of absorption were 
spurious and linked to very large values of T. The results are 
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Fig. 9. Distributions of intrinsic absorption for BLAGNs (solid 
histogram) and NELGs (dashed histogram). 




Fig. 10. Photon index versus intrinsic absorption for absorbed 
(F-test significance > 95%) BLAGNs (triangles) and NELGs 
(diamonds). 
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Table 3. Fractions of F-test absorbed objects and significances. 



Type 


N„ 


N ;lbs 


/- +a 




All 


1137 


245 


17 +um 

1 ' -0.02 


0.1335 


BLAGNs 


141 


16 


06 +0 05 


0.0089 


NELGs 


29 


13 


40 +015 


0.1740 


ALG 


7 


3 


4 +o.20 

-0.20 


0.0400 



a The f values are the mode of the function P(f) (see Sec. I5.2l for 

details). The errors are 90% 
b F is where P(f> F)=0.9973 (3cr) 



shown in Fig. ^3 The plot shows no apparent correlation be- 
tween T and Ny tr . 

A Kolmogorov-Smirnov comparison between the redshift 
distributions of unabsorbed and absorbed BLAGNs does not 
indicate a significant difference between the two. The same re- 
sult is obtained when comparing the 2-10 keV luminosity dis- 
tributions of absorbed and unabsorbed BLAGNs and NELGs 
(we use the 2-10 keV energy band because it is less affected by 
absorption). We conclude that X-ray absorption does not occur 
preferentially at any particular redshift (in BLAGNs) or X-ray 
luminosity. 

Finally, we have tested whether the lack of optical broad 
emission lines in unabsorbed NELGs can be explained in terms 
of AGN/host galaxy contrast (Lumsden et al. 120011 Moran et 
al. UHU7\ Page et al. |2"U0H Severgnini et al. |2"U0H . To this 
end, we have compared the 2-10 keV luminosity distributions 
of two sub-samples of unabsorbed BLAGNs and NELGs de- 
tected with redshifts ranging from 0.17 to 0.7, where the red- 
shift distributions of BLAGNs and NELGs overlap (see Fig.|2}- 
The Kolmogorov-Smirnov significance of the two distributions 
being different is ~ 99%. Furthermore, the average luminosi- 
ties of the selected BLAGNs and NELGs were found to be 
43.7+0.16 and 43.2+0.11 (in log units). Therefore, we have 
seen that our unabsorbed NELGs are on average significantly 
less luminous than unabsorbed BLAGNs at similar redshifts, 
and hence, our data are consistent with the explanation of non 
detection of broad optical lines in unabsorbed NELGs as an 
AGN/ host-galaxy contrast effect. 

5.3. Soft excess 

We have previously shown that ~7% of the sources identified 
as BLAGNs or NELGs exhibit significant soft excess emis- 
sion. To measure the broad band spectral parameters of these 
sources, and in particular the underlying power law index F we 
have used model D, as described in detail in Sec.|4] 

The results of the fits are listed in Table |4] where for each 
source we have included its IAU-style XMM-Newton source 
name, the optical classification, redshift, and the spectral pa- 
rameters measured with model D. We have also included the 
soft excess contributions to the flux and luminosity in the soft 
band and the F-test results obtained from the comparison of 
X 2 from model A to x 2 from model C and model D. We now 
discuss some particular sources in detail: 



XMMU J001831.7+162924, XMMU J222823.6-051306, 
XMMU J212904.5-154448: we obtained an improvement in 
the spectral fits with model C, but the F-test significance was 
in all cases below 95%, therefore the sources were classified 
as unabsorbed. A proper modelling of the soft excess emission 
with model D allowed us to find significant absorption above 
the Galactic value in all the sources. 

XMMU J221515.1-173224: we obtained an unusually 
steep spectral slope with model A, F = 2.67 tn'ofi' ^ e quality of 
the fit did not improve significantly with model C. With model 
D we obtained a significant improvement in the fit (> 99.99%). 
This model did not require absorption in excess of the Galactic 
value . The measured temperature of the soft excess emission 
was kT=0.05^QQj, below the typical values found for the rest of 
the sources. The fitted continuum was still significantly steep, 
with F = 2.60 .We tried to find a more typical value of F fit- 
ting the continuum slope at high energies (> 2 keV) with model 
A. We obtained a value of F significantly lower, 2.O6+03Z and 
in agreement with the average spectral slopes observed in un- 
absorbed AGN. With F fixed to the above value the fit did not 
improve with model C, but with model D we still obtained an 
F-test significance of > 99.99%. The model did not require ab- 
sorption in excess of the Galactic value, therefore we believe 
the improvement is due entirely to the soft excess component. 
We adopted this model as the best fit model for the source al- 
though the x 2 °f the fit was slightly worse that the value ob- 
tained with F fully variable. 

XMMU J061730.9+705955: we found a significant col- 
umn density with model C, hence the source was classified 
as absorbed. However we obtained a very steep spectral slope, 
F = 2.87+o^j. With model D we obtained a better fit of the data 
and more typical values for the spectral parameters, although 
model D did not require intrinsic absorption. We have chosen 
model D in preference to model C for this source. 

It is interesting to note that for some of the sources that we 
have analysed, we found very flat spectral slopes with model 
A, but they all required significant quantities of soft excess 
emission. This is the case for XMMU J12226.1+752616 and 
XMMU J222823.6-051306, with measured model A spectral 
slopes of r = 1 .52+°;*} and F = 1.78^ [[ respectively. 

5.4. Results from best fit model 

To test whether the spectral hardening observed using model 
A is due to an increase in intrinsic absorption at the faintest 
fluxes, we compared F (from the best fit model in terms of the 
F-test results) to the soft X-ray flux for each source. The best fit 
model can be model A or model B for the unidentified sources, 
and model A or model C for the identified sources (or model 
D for BLAGNs and NELGs when soft excess emission was 
detected). Using the best fit model for each object we obtained 
a value of (F) = 1.96 + 0.01 (the unweighted value was 2.08 + 
0.04). The dependence of (F) with the soft flux is plotted in 
Fig-in where we see that: 

1 . The average continuum of absorbed sources is significantly 
softer than the continuum of unabsorbed sources, especially 
at the faintest fluxes, with (F) = 2. 15±0.04(the unweighted 
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Table 4. Spectral parameters of sources with a soft excess component 
detected 



Name a 


Class redshift 


r 


Njf b kT! 


So.5-2 


L05-2 model C c model D d 








(10 20 cnr 2 ) (keV) 


(%) 


(%) 



XMMU J001831.7 + 162924 BLAGN 0.551 2.35+° 4.15 + .> 1 3 25 0.186^ 15.95 23.39 

XMMUJ021808.2 - 045844 BLAGN 0.712 2.08^ < 10 20 0.142+° [^ 6.55 19.49 

XMMU J021830.5 -045622 BLAGN 1.430 2.28!° : ]£ < 10 20 0.129^ 2 g 1.89 30.08 

XMMUJ061728.6 + 710600 BLAGN 0.219 1.68^ < 10 20 0.214^ 39.54 44.57 

XMMU J061730.9 + 705955 NELG 0.200 < 10 20 0.207+j} 272 44.14 49.31 

XMMU J0743 11.9 + 742935 BLAGN 0.332 2.05!^ < 10 20 0.119+° : [^ 12.22 22.12 



BLAGN 


0.551 


2.35 


BLAGN 


0.712 


2.08 


BLAGN 


1.430 


2.28 


BLAGN 


0.219 


1.68 


NELG 


0.200 


2.02 


BLAGN 


0.332 


2.05 


NELG 


0.238 


1.82 


BLAGN 


0.174 


1.83 


BLAGN 


0.431 


1.29 


BLAGN 


1.165 


2.06 


BLAGN 


0.956 


2.15 


BLAGN 


0.758 


1.52 



XMMU J12226.1 +752616 NELG 0.238 1.82+° ]^ 11.17+||g 0.146+°™ 12.30 22.72 

XMMUJ13316.2 + 241326 BLAGN 0.174 1.83^ < 10 2 ° 0.085^ 14.71 25.20 

XMMU J21 2904.5 - 154448 BLAGN 0.431 1.29^ 14.21+j 2 8 f 0.205^ 54.55 72.74 

XMMUJ221515.1 - 173224 BLAGN 1.165 2.06+° : ^ < 10 2b 0.212+^ 24.75 49.69 

XMMUJ221523.6- 174321 BLAGN 0.956 2.15+°^ < 10 20 0.112+jj 2 ^ 1.86 19.16 

XMMUJ222823.6- 051306 BLAGN 0.758 1.52+°;Jl! 15.6Uf 2 f 0.262+°;™ 35.78 54.55 

XMM-Newton source name 
Intrinsic absorption measured with model D 

Detection of intrinsic absorption with model C (F-test comparison of the^- 2 
from model A and model C) 

Detection of intrinsic absorption with model D (F-test comparison of the;f 2 
from model A and model D) 



Not 


abs. 


abs. 


Not 


abs. 


Not abs. 


Not 


abs. 


Not abs. 


Not 


abs. 


Not abs. 


abs. 


Not abs. 


Not 


abs. 


Not abs. 


abs. 


abs. 


Not 


abs. 


Not abs. 


Not 


abs. 


abs. 


Not 


abs. 


Not abs. 


Not 


abs. 


Not abs. 


Not 


abs. 


abs. 



_ ■ Whole sample 
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"T X-ray unabsorbffd srcs 
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Fig. 11. (r) versus So.5-2keV obtained from the best fit model of Fig. 12. T versus So.5-2keV f° r objects with flat spectrum (F 

each object. Squares denote the whole sample; stars absorbed positive error bar <1.5). We have grouped the objects in terms 

sources (F-test> 95.0%) and triangles unabsorbed sources (F- of the number of bins in their pn spectrum. 
test< 95.0%). 



value being 3.12+0.12 7 ) for the absorbed sources and (r) = 
1.94 + 0.01 (the unweighted value being 1.79 + 0.02) for the 
unabsorbed sources. 
2. There is no hardening of the average spectral slope for the 
absorbed sources. The hardening of T seen in these objects 
when fitted with the SPL model is likely to be an effect of 
absorption only. 

7 The unweighted mean is very high and different from the value 
that we obtained for the weighted mean, because there are a number 
of absorbed objects for which T > 2.5 but with large errors bars. 



3. For the sources where absorption was not detected in terms 
of the F-test, we still observe a significant hardening of the 
average spectral slope with the X-ray flux. 

Since the sources where absorption is detected do not ex- 
hibit any significant variation of the underlying spectral index 
with flux, does this hold true for the remaining sources? We first 
wanted to check whether any effect could lead to an apparent 
flattening of the spectral shape towards fainter fluxes when ab- 
sorption is included in the fitting model. To this goal, we have 
generated a number of fake spectra with an underlying power 
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law F = 2 at various fluxes spanning those of our sample, and 
different amounts of absorption at z — 0. 

We have then first fitted these spectra with models A and B, 
and then we have compared the^ 2 of the fits with the F-test. We 
found a range of low values of N° bs where absorption was not 
significantly detected when model A was replaced by model B 
(F-test < 95%). In these cases, the best fit value of F ( i.e., the 
value obtained with model A) was indeed lower than the input 
one, and it can reach values in the range F ~l-2. 

A second result of this exercise is, if we hold the absorption 
in the simulated spectra but vary the flux, at fainter fluxes the 
typical values of F fitted are marginally harder than at higher 
fluxes (and then fitted values of Nh tend to be lower than the 
input ones). We believe this effect to be due to a degradation of 
the spectral resolution of the X-ray spectrum, produced by the 
necessary grouping in very wide bands to achieve the desired 
statistics (10 counts/bin). Fhis leads, in particular, to a smooth- 
ing of the model count rate function (which is the effective area 
times the input spectrum, convolved with the redistribution ma- 
trix) in the whole spectral range, but in particular in the vicin- 
ity of its peak at soft energies where most of the counts are 
detected. Fhis produces an undesired slight hardening of the 
spectrum. 

However, this effect is unlikely to produce the hardening 
of the unabsorbed sources that we see. The reason is that in 
this case we should see the lowest quality spectra (i.e., those 
with the smaller number of bins) being more prone to this ef- 
fect, but this is not the case. Fig.ll2lshows the sources that we 
detect in the real sample with flat spectra (best fit F error bar 
upper limit below 1 .5), grouped in terms of the number of bins 
in their EPIC-pn spectra. We clearly see that the hardening is 
not correlated with the quality of the spectra, and therefore the 
above effect does not dominate. 

We conclude that the hardening towards fainter fluxes is 
real. This still leaves two options for its origin: intrinsic flat- 
tening of the power law or undetected low level absorption. 
Although the current data do not allow us to distinguish be- 
tween them, there is nothing inconsistent with the simplest hy- 
pothesis which is that these sources are absorbed at a level 
which is undetectable in the current data.This is also supported 
by the results of Fig.[6](right plot) where we do not see harden- 
ing of T with the 2- 1 ke V flux down to ~ 2x 1 (T 14 erg crrT 2 s~ 1 . 
If absorption is producing the hardening effect that we see in 
our objects, then we do not expect to see the same effect us- 
ing 2-10 keV fluxes, because they are less affected by absorp- 
tion. However, deeper observations are needed to assess this 
point for sources at fluxes similar to those of the sources in our 
sample. The results of a detailed X-ray spectral analysis of the 
sources that were detected with XMM-Newton in a deep obser- 
vation in the Lockman Hole field will be published in Mateos 
et al.|2005 (in preparation). 

The average spectral indices for the best fit model in the 
sub-samples of BLAGNs and NELGs are 1.99 ± 0.02 and 
2.35 + 0.09, both in agreement with the canonical value mea- 
sured for nearby unabsorbed BLAGNs (Nandra & Pounds 
I1994 l. Both sub-samples of sources exhibit the same depen- 
dence with the soft X-ray flux as shown in Fig. ^] We still 
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Fig. 13. (r) from best fit model versus S 0.5-2 kev for BLAGNs 
(triangles) and NELGs (circles). The hardening of the spectral 
slope is evident even among the sub-samples of AGNs. Error 
bars correspond to 90% confidence level. 

detect a clear correlation between the spectral slope and the 
soft X-ray flux. 

5.5. Photon index intrinsic dispersion 

In all the above analysis we have observed a clear dispersion in 
the measured spectral slope of our objects, even within the sub- 
samples of BLAGNs and NELGs, with the largest scatter in T 
being observed for the NELGs. To test whether the observed 
scattering is intrinsic to the sources we have followed the pro- 
cedure described in Nandra & Pounds ( 1994 > and Maccacaro 
et al. (1988 1, where the distribution of source spectral indices 
is assumed to be well reproduced by a Gaussian distribution 
of mean (T) and dispersion cr^. The best estimates of (F) 
and cr^r) are obtained with the maximum likelihood (ML) tech- 
nique. We have calculated the distribution of the spectral slopes 
that we obtained from the best fit model for the whole sample 
and the sub-samples of BLAGNs and NELGs. The results are 
shown in Table |5] In all the classes we measured a significant 
dispersion in spectral slopes. The large error in F obtained for 
NELGs did not allow us to differentiate the average spectral in- 
Table 5. Comparison of the weighted mean spectral photon in- 
dex obtained for different types of sources with the values from 
the Maximum Likelihood analysis. The best fit spectral slopes 
for each object were used in the calculations. 





Maximum 


Weij 


|hted 




Likelihood 


Mean 


Sample 


<r> 


a 


<r> 


Whole sample 


1 of. +0.02 
L 86 -0.03 


36 +0 01 

U -" J -0.04 


1.96: 


+ 0.01 


BLAGN 


1 98 +004 

Ly0 -0.04 


21 +0 05 

-0.04 


1.99: 


t0.02 


NELG 


1 oc +0.16 
1.83 _ 017 


36 +014 


2.35: 


t0.09 
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Fig. 14. Contour diagrams for the average spectral slope and 
intrinsic dispersion of our objects obtained from the Maximum 
Likelihood analysis (see Sec. l5.5> . The spectral slopes obtained 
from the best fit model were used in the calculations. Solid line 
for BLAGNs and dashed line for NELGs. The contours are de- 
fined as Ay 2 =2.3, 6.17 and 1 1.8 corresponding to standard 1,2 
and 3<x confidence regions for two parameters. 



dex of NELGs from that of BLAGNs. We see that the mean 
values of T computed with the ML and the weighted mean 
are very different, in particular for the NELGs. The reason is 
that with the ML method objects with untypical F (i.e. outliers) 
and not very large errors will increase the dispersion of the fit- 
ted gaussian distribution rather than significantly affecting the 
mean. 

Our results do not show any trend in the underlying power 
law of the NELGs to be harder than that of the BLAGNs even 
at the faintest fluxes. Previous results based on ROSAT data 
(see Almaini et al 119961 Romero-Colmenero et al. 119961 do 
find a flatter average spectral slope for NELGs (~ 1.5) than 
for BLAGN (~ 2). Our data shows that the spectral differences 
between these two classes of sources are due mostly (if not 
totally) to differing amounts of absorption rather than to dif- 
ferences in the underlying power law spectra. Therefore many 
objects classified as NELGs may contain active nuclei. 

Fig. El shows the (T) versus o-<r> contour diagrams for 
the sub-samples of BLAGNs (solid lines) and NELGs (dashed 
lines), clearly showing the existence of an intrinsic dispersion 
in T with a confidence of more than 3cr. 

5.6. Spectral cosmic evolution 

No correlation was found between F and the rest-frame 0.5- 
10 keV luminosities, as shown in Fig. [21 For the sub-sample 
of BLAGNs we studied the measured T that we obtained from 
each best fit model as a function of the redshift, but we did not 
find any clear tendency between the two parameters as can be 
seen in Fig.[T6l 

We have studied the redshift distribution of the mea- 
sured rest frame absorbing column densities for the absorbed 
sub-samples of BLAGNs and NELGs. The results, shown in 
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Fig. 15. Weighted (T) versus Lo.s-iokev for BLAGNs (triangles) 
and NELGs (circles). Error bars correspond to 90% confidence 
level. 




Fig. 16. (r> versus redshift for BLAGNs. We have used the val- 
ues of T from the best fit model for each object (i.e. with ab- 
sorption and soft excess if required). Error bars correspond to 
90% confidence level. 

Fig. ^] show no evidence of evolution of Nj^' 1 with redshift. 
However, the ranges of the 2-D parameter space that we fully 
sample with our source list are unclear. 

We expect the minimum detected absorption to increase 
with z. This can be easily explained in terms of absorption be- 
ing redshifted towards lower energies and further decreased by 
a factor (1 + z) 2 ' 1 as we move upwards in z. 

To study if there exists a maximum detected column (only 
the brightest sources being actually detectable when heavy ab- 
sorption is present), and if this limit depends on redshift, we 
have carried out simulations as follows. A 3-D parameter space 
was defined with the axes being redshift, luminosity and intrin- 
sic (rest-frame) absorption. For each parameter we have de- 
fined a grid of values covering the observational limits estab- 
lished by our sources. First, for each point in the N^-z plane 
we simulate a spectrum for each BLAGN or NELG source in 
our sample, using its spectral response matrices and exposure 
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Fig. 17. Intrinsic absorption as a function of redshift for ab- 
sorbed (F-test significance >95%) BLAGNs (triangles) and 
NELGs (circles). 



The inferred high X-ray luminosities obtained from the 
analysis of their X-ray spectra strongly suggests that an AGN 
is the source of the X-ray emission in at least 5 out of the 7 
objects. The peculiar properties of these objects have been ex- 
plained by a very high column density that would be obscuring 
the AGN broad and narrow line regions. This hypothesis has 
found observational support, because many of the ALGs stud- 
ied have been found to be X-ray obscured (Mainieri et al. 2002 
Comastri et al. I2002i . However there are several examples of 
ALGs for which the obscuration model does not apply. In these 
cases other possible explanations have been suggested, e.g. that 
a BL Lac could be present in the core of some ALGs (Brusa et 
al. 2003 1. Alternatively some objects might host an AGN with 
a non-standard accretion disk (Yuan & Narayan 2004 1. A final 
possibility is that some ALGs may have normal (unabsorbed) 
AGNs that could be hidden if the host galaxy is bright enough 
(Severgnini et al. I2UU31 Page et al. l2"U0H . 

We now describe the X-ray spectral properties of the ALGs 
found in our list: 



time and assuming F — 2. As explained in Sec. [5] we have 
only analysed those spectra with more than 50 counts (back- 
ground subtracted). We use this limit to calculate the required 
minimum luminosity for each source to fulfil this constraint. 
Luminosities lower than this one would produce a spectrum 
outside our quality bounds. 

Second, for a number of luminosities above this limit (still, 
for each value of intrinsic absorbing column and z) we simu- 
late a spectrum for every source. These spectra were analysed 
following the same procedure that we used for the real spectra 
giving a full coverage of the above 3-D parameter space. 

We then examined the fitted values of N™ tr as a function of 
redshift z for a range of luminosities. The simulations confirm 
that the minimum detection of absorption depends strongly on 
z for the reason explained above. We also found two limits to 
the maximum absorption detected as a function z- The first one 
is the source detection itself: heavy absorption reduces signifi- 
cantly the flux of the source and makes detection very difficult, 
especially at high redshifts. The second limit is the detection 
of absorption: even if the source is detected, if its luminosity 
is not high enough we will measure an absorption below the 
maximum value established by the detection limit. 

This strong dependence on luminosity and redshift makes 
it very difficult to derive the real absorbing column distribution 
in terms of the fitted values. The best way we found to tackle 
this problem is to compare our results with the predictions of 
specific models for the distribution of absorbing columns, fold- 
ing the input values through the same effects as the real data. 
This is discussed in Sect. [7]in the framework of unified AGN 
models for the XRB. 



6. ALGs X-ray spectra 

The objects with galaxy-like optical spectra not showing emis- 
sion line signatures were classified as absorption line galaxies 
(ALG). 



XMMU J122017.7+752216 is an optically normal galaxy 
that looks extended in the X-ray image. Its soft X-ray emis- 
sion is best reproduced with a thermal model with temperature 
AT = 0.64+5}!$ keV and metallicity Z = Zo. The fit does not im- 
prove leaving Z as a free parameter. However, the high energy 
X-ray spectrum is best modelled with an unabsorbed power law 
of spectral slope F — 1 .87*2 }^. We cannot confirm whether the 
source of the hard X-ray emission is an AGN due to the low 
value of the hard X-ray luminosity inferred from this model 
(L^c^.lxlO^ergs- 1 ). 

The nature of XMMU J021822.1-050612 was studied in 
detail by Severgnini et al. (|2003 1. They found the X-ray spec- 
trum to be best modelled with an absorbed power law. 

The X-ray spectra of XMMU J010320.2-064157 
and XMMU J084211.6+710145 were best reproduced 
with an obscured (F-test values 99.7% and 95.0%) 
AGN with r = 2.39^|| N H = 4.9+Jjij x 10 22 cnT 2 and 
L 2 -io = 2 x 10 44 erg s _1 for the first object, and F = 1.82+°}!?, 
N H = 9.3+|;| x 10 20 cnr 2 and L 2 _io = 3.8 
the second object. 



x 10 43 ergs 1 



for 



XMMU J214041.3-234719has an unabsorbed X-ray spec- 
trum with T = 2.08+°;°jj and L 2 -io = 6.2 x 10 40 erg s _1 . Hence, 
from its X-ray spectrum alone, we would expect this source to 
be a BLAGN optically, rather that an ALG, according to the 
simplest AGN unified model. 

A simple power law model can reproduce the spectra 
of XMMU J233113.6+20056 (see Fig. Q2) and XMMU 
J122312.6+75247, but the best fit spectral slopes were found 
to be well below the typical value found for unabsorbed AGNs 
(T = 0.94+° ^ and T = 1.36+°|°). In terms of the F-test none of 
the spectra required absorption in excess of the Galactic value, 
although with the absorbed model we obtained more typical 
spectral slopes. The low quality of the spectra has not allowed 
us to study in more detail whether the spectra of these sources 
is intrinsically flat or absorbed. 
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Fig. 18. Top panel: MOS+pn spectrum of the absorption line 
galaxy XMMU J233113.6+20056 at z=0.378. The X-ray 
spectrum was best fitted with a flat (T ~ 0.944) unabsorbed 
power law. Bottom panel: optical spectrum of the source, where 
no obvious broad lines were detected 

7. Testing population synthesis models 

The standard population synthesis models of the XRB, based 
on unification schemes for AGN, can reproduce successfully 
the XRB with an appropriate mixture of unobscured and ob- 
scured AGN, as well as a number of observational constraints. 
The input parameters of these models are the type 1 and type 
2 measured X-ray spectra and cosmological evolution, which 
is assumed to be identical. The X-ray luminosity function of 
absorbed objects is still unknown, but it is assumed to be the 
same as for unabsorbed objects. The ratio of absorbed to un- 
absorbed AGN comes out to be rather large in these models. A 
key descriptor of these models is the distribution of absorbing 
column densities among the sources. The N™ tr distribution is 
now observationally known for the local Seyfert galaxies (e.g., 
Risaliti et al. 1999 1, and in general these models assume it to 
be independent of redshift and luminosity. 

The analysis that we have conducted has allowed us to mea- 
sure the distribution of absorption in one of the largest samples 
of BLAGNs and NELGs studied up to now, and its cosmic evo- 



lution over a wide range of redshifts (see Fig. |5] and Fig. I17i . 
In Sec. l5.6l we reported that the measured absorption is slightly 
lower than the real one, depending on the luminosity and red- 
shift of the source. Hence, in order to compare the predictions 
of XRB models to our data, we assume an input distribution for 
N™" (independent of luminosity and redshift), and then fold it 
through the same detection effects as in our data. 

We have used the N™ tr distributions adopted by Comastri 
et al. fT995l Gilli et al. l2UoT1 and Pompilio, La Franca & Matt 
2000 as input parameters to their analytical models of the 
XRB. We have simulated spectra for each one of the BLAGNs 
and NELGs included in our list, by fixing their redshifts, ex- 
posure times, position and luminosities. The slope of the con- 
tinuum was assumed to be F = 2 for all the sources. For each 
source we extracted NS tt values from the specific model distri- 
bution, until the simulated spectrum fulfilled the quality filters 
that we used for the real spectra (binned spectra with at least 5 
bins and source net counts above 50). Indeed, heavily obscured 
sources do not pass, just as happens for real sources. We fitted 
the spectra with model A and model C and computed F-test 
significances for absorption, using the same calibration matri- 
ces. 

Fig \19\ compares the values of Nj'J' 1 ' measured in the real 
data and in the simulations for one of the XRB models (Gilli 
et al. 2001 1. Obviously, the adopted distribution is not able to 
reproduce the observational results. Very similar graphs are ob- 
tained for the models of Comastri et al 1995 and Pompilio et 
al. 2000 For all three models there is a clear excess of sources 
with intrinsic absorptions N™ tr > 10 22 cirT 2 , which are not 
present in the real data. For an F-test confidence level of > 95% 
we detect, in our real sample, absorption in 29 sources out of 
179. The number of absorbed sources detected in the simulated 
data sets were 79, 67 and 71 for the Comastri et al. 1 19951 Gilli 
et al. 2001 and Pompilio et al. 12000 distributions, in all cases a 
factor of more than 2.5 over the observed value. This result is 
also evident from Fig[l9](right panel), where the predicted dis- 
tribution of absorbing columns with the Gilli et al. 2001 model 
is plotted against redshift. We find excess of absorbed objects 
all over the redshift interval up to redshifts of ~2. Discrepancies 
between the predictions from synthesis models and what it is 
obtained from observations have been observed previously (i.e. 
Piconcelli et al. 120021 Caccianiga et al. 2004 Georgantopoulos 
et al. l2"U04l Perola et al. l2U04l 

In the recent "modified unified scheme" of Ueda et al. 2003 
the AGN absorbing column density distribution depends on lu- 
minosity (not redshift dependence). The fraction of absorbed 
AGNs (Nh > 10 22 crrT 2 ) decreases significantly with luminos- 
ity. This model will reduce the observed discrepancies funda- 
mentally at high luminosities, and therefore at high redshifts. 
However, this model predicts a significant fraction of objects 
with Nh > 10 22 cirT 2 at the typical luminosities of our sample 
that we do not detect. The Ueda et al. 2003 model, although 
more in line with our results, still overpredicts the number of 
absorbed objects (Nh > 10 22 crrT 2 ) especially at low redshifts. 

One possible caveat is that some of these absorbed sources 
that we do not detect in the real spectra might be hiding within 
the unidentified sources. Our simulations indicate that ~ 40% 
should have detectable absorption. However, even for the full 



S. Mateos et al.: X-ray spectra of medium sources 



17 




20 21 22 23 24 25 1 2 3 

Log N^ ntr (cm -2 ) redshift 

Fig. 19. Left: Histograms of column density distributions in absorbed objects obtained from a simulated input distribution of N™ tr 
according to the model of Gilli et al. 120011 The dashed histogram represents the observed distribution for our sources. Right: 
N™ tr as a function of sources' redshifts from simulations. 



sample of 1 137 objects we only detect absorption in 10-20% of 
them. Assuming that the redshift distribution of the identified 
sources does not strongly deviate from the true one (and there 
is evidence that this is the case from other surveys), then 40% 
of absorbed sources predicted by the simulations should also 
be found in the whole sample, while only ~ 20% are seen. This 
confirms that XRB synthesis models predict twice as many ab- 
sorbed sources at our flux levels than we actually observe. 

8. Conclusions 

We have conducted a detailed analysis of the broad band spec- 
tral properties of a large sample of sources detected serendip- 
itously with the XMM-Newton observatory. Various spectral 
models have been tested to reproduce the 0.2-12 keV emission 
of our sources. We summarise our main conclusions as follows: 

1. Fitting the spectra of all the sources with a single power 
law we obtain a weighted mean of (F) = 1.86 + 0.02, in 
agreement with previous surveys. However, for a signifi- 
cant number of sources, the quality of their fits was very 
poor, with 7-12% of the sources having statistically unac- 
ceptable fits (null hypothesis probability < 5%). 

2. We observed the continuum average spectral slope to be- 
come harder with decreasing 0.5-2 keV flux. This effect 
does not appear to be correlated with the quality of the 
spectral data. After correcting for all the biases that could 
be affecting our results, we still see the same dependence, 
therefore the hardening of F with decreasing soft flux is 
real. 

3. Many objects in our sample exhibit absorption in excess 
of the Galactic value. In terms of the F-test (95% confi- 
dence level), excess absorption is found in 17% of sources. 
Significant fractions of X-ray absorbed objects were found 
among NELGs (~ 40%), BLAGNs (~ 6%) and ALGs (~ 
40%). This result clearly suggests that the relationship be- 
tween optical identification and X-ray spectral signatures is 
still unclear and needs some revision. 



4. The distributions of absorbing columns in BLAGNs and 
NELGs do not appear significantly different within the 
limitations of our sample, except for the overall factor 
that NELGs are 4 times more frequently absorbed than 
BLAGNs. 

5. The typical absorption columns derived for BLAGNs 
(Nh < 10 22 cm -2 ) indicate that perhaps the host galaxy, or 
some sort of weak distant absorber might provide the ab- 
sorbing gas, without blocking too much the line of sight to 
the Broad-Line region. This point deserves further explo- 
ration. 

6. Using the best fit model for each object, i.e., including ab- 
sorption and soft excess when required by the data, we still 
observe a substantial hardening of the average spectral in- 
dex towards fainter soft fluxes. The same effect was also ob- 
served for the sub-samples of BLAGNs and NELGs. With 
the current data quality, we cannot reject that these sources 
have the same underlying power law (T ~ 2) with moder- 
ate absorption, nor can we reject that there is a population 
with genuinely flatter spectral index. However when hard 
fluxes are used to study this effect there is no evidence of 
a change of average spectral slope. This supports the hy- 
pothesis that absorption is producing the hardening of the 
average spectra of our sources, because hard band fluxes 
are less affected by absorption. 

7. Using a maximum likelihood fit we have detected an in- 
trinsic dispersion of the spectral slope, even within the sub- 
samples of BLAGNs and NELGs, with the largest scatter- 
ing observed for the NELGs. 

8. We did not find a tendency of absorbed BLAGNs to occur 
preferentially at any particular redshifts. The same result is 
obtained when comparing the 2-10 keV luminosity distri- 
butions of absorbed and unabsorbed BLAGNs and NELGs. 
Therefore, no evidence is found of unabsorbed BLAGNs 
and NELGs having different X-ray luminosities to the ab- 
sorbed objects. 
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9. We have seen that our unabsorbed NELGs are on average 
less luminous than the unabsorbed BLAGNs and hence our 
data are consistent with the explanation of the lack of broad 
lines in the optical spectra of unabsorbed NELGs as an 
AGN/ host-galaxy contrast effect. 

10. A soft excess component was detected with a high signifi- 
cance in ~ 7% of BLAGNs and NELGs. 

11. For XRB synthesis models based on the AGN unified 
scheme, simulations were conducted to test the distribution 
of absorbing columns from these models on the population 
of BLAGNs and NELGs. In general, the number of sources 
where absorption was detected in the simulated spectra (in 
about ~ 40% of them) was a factor of ~ 2.5 above the 
measured values for all models. Regardless of the incom- 
pleteness of the identifications, the XRB synthesis model 
prediction of detectable absorption in 40% of the sources is 
at odds with the overall observed rate of absorbed sources 
in the real sample. 

12. The distribution of absorbing column densities predicted 
by the synthesis models and measured through the simula- 
tions is significantly skewed towards high column densities 
N™ 11 " ~ 10 22 - 10 24 crrT 2 . These highly absorbed sources, 
predicted by the XRB models at our flux levels, are not 
detected in our sample of objects. 
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Appendix A: Sources' detection efficiency 
function 

The use of different energy bands with different sensitivities to 
detect our sources, the selection of fields covering a wide range 
of exposure times (from ~ 10 to ~ 100 ksec), and the quality 
filters applied to the spectra (especially the selection of spectra 
with MOS+pn counts > 50) makes the calculation of the ef- 
ficiency of source detection as a function of flux difficult. We 
know it will be a function of the X-ray flux, but it is important 
to know whether it also depends on the spectral shape of the 
sources. 

We have conducted simulations to calculate the efficiency 
of detection, W(r, S): for a given number of sources with spec- 
tral parameters T and S, W(T, S) gives us the fraction of simu- 
lated objects that were detected, i.e., their spectra fulfilled the 
quality filters that we applied to our data, see Sec.[3j. and with 
best fit spectral parameters being the same as the input (sim- 
ulated) ones. This function can be used to correct observed 
source properties, for example the dependence of (T) with the 
X-ray flux, from all the effects listed above. We defined a grid 
of points in F-S covering the range of values measured for our 
objects (T from 0.5 to 3; S from 10~ 16 to 10 _12 ergcm _2 s _1 ). 

We have simulated the same number of spectra on each grid 
point. For these simulations we have used the list of detected 
sources before applying the quality filters, i.e., 2145 sources 
have been simulated for each point in the F-S grid. The input 
parameters of each simulated spectrum were the exposure time, 
response matrices and Galactic absorption of each real source, 
and the values of F and S at each grid point. We used model A 
(see Sec. 14. 1> to simulate all the spectra, therefore absorption 
is not included in the simulations. Once all the spectra were 
simulated, we applied the same quality filters that we used for 
our real data. Our source detection efficiency, at each grid point, 
was defined as 



W(T,S) = 



N del (r,s)+N det (r,s') 




Fig.A.l. Dependence of the detection efficiency of sources 
as a function of F at different flux levels, obtained from sim- 
ulations. Some values of the efficiency function can be above 
100% because the fitted parameters may differ from the input 
ones (see text for details). 
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Fig. A.2. Representation of the average movements in the F-S 
plane obtained from simulations. The beginning point of each 
arrow indicates the input values of T and S in the simulations; 
the end points are the observed (fitted) average values of F and 
S for the detected sources. 

It is important to notice that, there are two possible contribu- 
tions to the final number of sources that will appear at a given 
grid point (r, S): 

1. The number of simulated sources at the grid point (F, S) 
that, after fitting, remain at the same grid point, N de t(F, S). 

2. The number of sources simulated at different grid points 
(T' , S' ) that were detected, and with best fit spectral param- 
eters (r, s), N det (r, s'). 



where N S i„, is the number of sources that were simulated at the 
grid point (F, S) (this number is the same for all the grid points). 



The values of the detection efficiency as a function of T 
are plotted in Fig. IA.1I for different fluxes. We observe that, 
at fluxes below ~ 10~ 14 erg crrT 2 s -1 , the efficiency of detec- 
tion decreases significantly from F=0.5 to F=2, and then, starts 
to increase up to F=3. The rise of efficiency at high values 
of F could be due to the sharp increase in the effective areas 
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of the EPIC detectors at low energies combined with the in- 
crease in the relative contribution to the total counts at these 
energies for the steeper spectral slopes. It is interesting to no- 
tice that, although we have shown that the detection efficiency 
increases for T values above 2.5, we have not detected these 
super-soft sources in our sample, hence, they must be rare. The 
reason why the efficiency function is higher for objects with 
hard (F ~0.5) spectral slopes (notice that the effective area of 
the X-ray detectors drops rapidly at energies below ~1 keV) is 
that a source with a flat spectral slope has to be much brighter 
than one with a soft (T ~2) spectral slope in order to have the 
same 0.5-2 keV flux. Therefore we will receive more counts in 
the 0.2-12 keV band from flatter objects, which means that it 
will better detect them and more easily pass our spectral quality 
filters. 

Our simulations showed that there are movements of ob- 
jects in the T-S plane, i.e., the fitted parameters may differ from 
the input ones. It is necessary to study how important this effect 
is and whether there are systematic shifts (for example a ten- 
dency to detect the sources with lower values of gamma at faint 
fluxes). Hence, we have calculated for each grid point the av- 
erage values of T and S, in the same way as we did for the real 
sample, from all the sources simulated at that point that were 
detected (no matter at which grid point they were detected). 
The results, shown in Fig. IA.2l confirm the existence of shifts 
in the T-S plane at the faintest fluxes: on average, the measured 
T values tend to be softer, and hence the observed 0.5-2 keV 
fluxes are lower. However, for fluxes > 10~ 15 erg cirT 2 s _1 these 
movements are not significant and tend to soften than harden 
spectra. 



